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O Abstract 

Regenerating codes are a class of recently developed codes for distributed storage that, like Reed-Solomon 
codes, permit data recovery from any subset of k nodes within the n-node network. However, regenerating codes 
possess in addition, the ability to repair a failed node by connecting to an arbitrary subset of d nodes. It has 
been shown that for the case of functional-repair, there is a tradeoff between the amount of data stored per node 
and the bandwidth required to repair a failed node. A special case of functional-repair is exact-repair where the 
replacement node is required to store data identical to that in the failed node. Exact-repair is of interest as it 
greatly simplifies system implementation. 

<N 

t> The first result of the paper is an explicit, exact-repair code for the point on the storage-bandwidth tradeoff 

corresponding to the minimum possible repair bandwidth, for the case when d = n — 1. This code has a 
particularly simple graphical description and most interestingly, has the ability to carry out exact-repair through 
mere transfer of data and without any need to perform arithmetic operations. Hence the term repair-by -transfer. 

The second result of this paper shows that the interior points on the storage-bandwidth tradeoff cannot be 
achieved under exact-repair, thus pointing to the existence of a separate tradeoff under exact-repair. Specifically, 
we identify a set of scenarios, termed 'helper node pooling', and show that it is the necessity to satisfy such 
scenarios that over-constrains the system. 



Index Terms 

Distributed storage, node repair, regenerating codes, minimum bandwidth, storage-repair bandwidth tradeoff. 



Nihar B. Shah, K. V. Rashmi and P. Vijay Kumar are with the Dept. of ECE, Indian Institute Of Science, Bangalore, India. Email: {nihar, 
rashmikv, vijay} Oece.iisc.ernet.in. P. Vijay Kumar is also an adjunct faculty member of the Electrical Engineering Systems Department 
at the University of Southern California, Los Angeles, CA 90089-2565. Kannan Ramchandran is with the Dept. of EECS, University of 
California, Berkeley, USA. Email: kannanr@eecs.berkeley.edu. 

Preliminary versions of this paper were presented at Allerton Conf., Urbana-Champaign, Sep. 2009, ITA, San Diego, Feb. 2010, and 
ISIT, Austin, Jun. 2010. 



2 



I. Introduction 

In a distributed storage system, information pertaining to a data file is dispersed across nodes in 
a network in such a manner that an end-user (termed as the data-collector or DC) can retrieve the 
data stored by tapping into a subset of nodes in the network. A popular option that reduces network 
congestion and leads to increased resiliency in the face of node failures, is to employ erasure coding, 
for example by calling upon maximum-distance- separable (MDS) codes such as Reed-Solomon (RS) 
codes. 

Let the data stored in the network be represented by a collection of B message symbols, with each 
message symbol drawn from a finite field ¥ q of size q. With RS codes, data is stored across n nodes 
in the network in such a way that the entire data can be recovered by a data-collector by connecting to 
any arbitrary k = B nodes, a process of data recovery that we will refer to as reconstruction. Several 
distributed storage systems such as RAID-6 [[TJ, OceanStore |2| and Total Recall [3] employ such an 
erasure-coding option. 

Upon failure of an individual node, a self-sustaining data storage network must necessarily possess 
the ability to regenerate (i.e., repair) the failed node. An obvious means of accomplishing this task is 
by first permitting the replacement node to download the entire data stored in any k nodes and then 
proceeding to extract the data that was stored in the failed node. Such a procedure is indeed mandated 
when RS codes are employed to distribute the data and nodes are restricted to carry out linear operations 
on the data stored within them. 

RS codes treat the data stored in each node as a single symbol belonging to the finite field ¥ q . When 
this is coupled with the restriction that individual nodes perform linear operations over F 9 , it follows 
that the smallest unit of data that can be downloaded from a node assisting in the repair of a failed 
node equals the amount of information stored in the node itself, i.e., is the equivalent of an ¥ q symbol. 
As a consequence of the MDS property of an RS code, when carrying out repair of a failed node, the 
replacement node must necessarily collect data from at least k other nodes. As a result, it follows that 
the total amount of data download needed to repair a failed node can be no smaller than B, the size of 
the entire message. 

However, downloading the entire message of size B in order to repair a single node that stores only a 
fraction of the entire data is wasteful, and raises the question as to whether there is a better alternative. 
Such an alternative is provided by the concept of a regenerating code introduced in the pioneering paper 
by Dimakis et al. [(4)- 

A. Regenerating Codes 

In the regeneration framework introduced in [4], codes whose symbol alphabet is a vector over ¥ q , 
i.e., an element of for some parameter a > 1 are employed. As with RS codes, each node still 
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Fig. 1: The regenerating codes setup: (a) data-reconstruction, and (b) repair of a failed node. 



continues to store a single code symbol. However, given the vector nature of the code-symbol alphabet, 
we may equivalently, regard each node as storing a collection of a symbols, each symbol drawn from 
¥ q . Under this setup, it is clear that while maintaining linearity over ¥ q , it is possible for an individual 
node to transfer a fraction of the data stored within the node. 

Apart from this new parameter a, two other parameters (d, (3) are associated with the regenerating 
framework introduced in [|4j. A replacement node is permitted to connect to an arbitrary subset of d 
nodes out of the remaining (n — 1) nodes while downloading (3 < a symbols from each node. The d 
nodes helping in the repair of a failed node are termed as helper nodes. The total amount d(3 of data 
downloaded for repair purposes is termed the repair bandwidth. Thus under this framework, apart from 
the field size q, we have 

{ [n, k, d), (a, (3, B) } 

as the parameter set. The corresponding codes are called regenerating codes. Typically, with a regenerat- 



ing code, the average repair bandwidth d(3 is small compared to the size of the file B. Fig. la and Fig. lb 



illustrate reconstruction and node repair respectively, while also depicting the relevant parameters. 

Note that the parameters k and d are the minimum values under which reconstruction and repair can 
always be guaranteed. This restricts the range of d to 

fc<d<n-l, (1) 



for, if the repair parameter d were less than the reconstruction parameter k, this would imply that one 
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could in fact reconstruct the data by connecting to d nodes, thereby contradicting the minimality of k. 

B. Exact versus Functional Repair 

Under the notion of functional repair introduced in [4], a failed node is replaced by a node that is 
functionally equivalent, i.e., following replacement, the resulting network of n nodes must continue to 
possess the reconstruction and repair properties. In contrast, under exact-repair, introduced subsequently 
in (7), [[8j, a replacement node is required to store exactly the same data as was stored in the failed 
node. Hence, there is no change in the coefficients of a replaced node under exact-repair. This obviates 
additional communication overheads during the repair operation, and also avoids re-tuning of the 
reconstruction and repair algorithms. Thus exact-repair greatly simplifies system implementation and is 
of considerable practical interest. 

We use the term exact-repair code to denote a regenerating code that is the capable of performing 
exact-repair of any failed node. 

C. The Storage-Repair Bandwidth Tradeoff 

A major result in the field of regenerating codes is the proof in pj that uses the cut-set bound of 
network coding to establish that the parameters of a regenerating code must necessarily satisfy [j] 

fc-i 

B < ^min{a, (d - (2) 

i=0 

It is desirable to minimize both a as well as (3 since, minimizing a results in a minimum storage 
solution, while minimizing (3 results in a storage solution that minimizes repair bandwidth. As can be 
deduced from ([2]), it is not possible to minimize both a and (3 simultaneously and thus there exists a 
tradeoff between the choices of the parameters a and (3 and this tradeoff is termed as the Storage-Repair 
Bandwidth Tradeoff, or more simply as the storage-bandwidth tradeoff. In Fig. [2j the tradeoff is plotted 
for B = 27000 symbols for a system with k = 10, d = 18 and some n > 18. 

For fixed values of B and [n, k, d], a regenerating code is said to be optimal, if the parameters 
(a, (3) are such that: 

1) equality holds in equation (|2]), i.e., 

k-l 

B = ^min{a, {d-i)/3}, (3) 

i=0 



This bound on the message size B is originally derived in [5] using the principles of network coding. An information-theoretic 
derivation is presented in Section IV-B in the present paper. 
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Fig. 2: The storage -bandwidth tradeoff curve: storage space a vs repair bandwidth d/3. Also depicted 
are the two end points MBR and MSR, and two interior points IPx and IP 2 . 



and 

2) if either a or (3 is decreased, equation ([3]) fails to hold. 
The parameters (a, (3) of any optimal regenerating code are said to lie on the storage-bandwidth tradeoff. 

Observe that when a < (d — (k — l))/3, the parameter (3 can be decreased without violating ([3]). 
Hence the parameters of an optimal regenerating code must necessarily satisfy 



Thus, the case of 



a > (d-k + l)/3. 
a= (d- k + l)/3 



(4) 
(5) 



is one of the extreme points of the tradeoff called the Minimum Storage Regenerating (MSR) point. 
From equation Q, we see that the parameters at the MSR point satisfy 



B = ka. 



(6) 



On the other hand, if /3 < ^ (i.e., if a > d/3), the parameter a can be decreased without violating ([3]). 
Hence the parameters of an optimal regenerating code must necessarily satisfy 

a 



The case when f3 = ^ i.e., 



a = d(3 



(7) 
(8) 
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is the other extreme point of the tradeoff called the Minimum Bandwidth Regenerating (MBR) point. 
From equation (|3]) we see that the parameters at the MBR point satisfy 



Points on the tradeoff other than the two extreme (MSR and MBR) points, have the parameter a lying 
strictly within this range: (d — k + l)/3 < a < d/3. These points are hence referred to as the interior 
points on the tradeoff curve. 

Note that the situation when k = 1 results in B = a = d/3 which can be satisfied trivially by a 
repetition code. Thus, we assume k > 1 through the rest of the paper. 

The storage-bandwidth tradeoff was derived in J5J for the case of functional repair and was shown 
to be tight in (5J, [|6j. Clearly, the bound continues to hold even under the exact-repair setting, since 
exact-repair is an instance of a functional-repair. However, the achievability of this bound under an 
exact-repair requirement has remained an open problem, and will be addressed in the present paper. 

D. Summary of the Results in This Paper 

The first main result of the present paper is an explicit construction of exact-MBR codes for the case 
d = n — 1, that has a simple graphical description. An interesting, and potentially useful, aspect of 
this construction is its repair-by -transfer property where a failed node is repaired by simple transfer of 
data without need for any computation at either the helper nodes or the replacement node. Furthermore, 
when specialized to the parameter set [n, k = n — 2, d — n — 1], encoding of the code as well as 
data-reconstruction can be accomplished using XOR operations alone. These properties give the code 
practical appeal [^J 

The second main result of the present paper answers an open problem regarding the achievability 
of the storage-bandwidth tradeoff under exact-repair, at the interior points. First, a set of properties 
required to be satisfied by an exact-repair code are derived; in particular, a set of constraining scenarios, 
which we call 'helper node pooling' scenarios, are identified. Subsequently, the non- achievability of 
the interior points on the storage-bandwidth tradeoff under exact-repair is established, with the possible 
exception of points within the immediate vicinity of the MSR point. 

2 An animated video of an example of this code is available in |24|. 




(9) 
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E. Organization 

The paper is organized as follows. A brief overview of related literature is provided in Section |ll| 
Section lin] contains the exact-MBR code construction. A set of properties that any exact-repair code must 



necessarily satisfy are provided in Section IV which are then used to establish the non-achievability 
of the storage-bandwidth tradeoff for exact-repair at nearly all interior points. Section [V] presents 
conclusions. 



II. Related Work 

The concept of regenerating codes, introduced in Q, [|5j, permit storage nodes to store more than 
the minimal B/k units of data in order to reduce the repair bandwidth. Several distributed systems are 
analyzed, and estimates of the mean node availability in such systems are obtained. Using these values, 
the substantial performance gains offered by regenerating codes in terms of bandwidth savings are 
demonstrated. The problem of minimizing repair bandwidth for functional repair of nodes is formulated 
as a multicast network-coding problem in a network having an infinite number of nodes. A cut-set 
lower bound on the repair bandwidth is derived. Coding schemes achieving this bound are presented 
in [[5j, [[6) which however are non-explicit. These schemes require large field size and the repair and 
reconstruction algorithms are also of high complexity. 

The notion of exact-repair is introduced independently in (7J and [[8j. In (jTJ, the MSR point is shown 
to be achievable under exact-repair for the parameters [n, k = 2, d — n — 1]. The proposed coding 
scheme uses the concept of interference alignment. Even here, the constructions are not explicit, and 
have large complexity and field- size requirement. 

The first explicit construction of regenerating codes appears in (8). An explicit MSR code is con- 



structed here, for [n, k, d = k + 1] (see also [11 1). A computer search for exact-repair MSR codes for 
the parameter set [n = 5, k = 3, d = 4] is carried out in ||9j, and for this set of parameters, codes for 
several values of field size are obtained. 



A slightly different setting, from the exact-repair situation is considered in [13|, where optimal MDS 
codes are given for the parameters [n > 2k, k, d = k + 1]. Again, the schemes given here are 
non-explicit, and have high complexity and large field-size requirement. 

An explicit code structure at the MSR point, that guarantees reconstruction and exact-repair of the 



systematic nodes is provided in [10|, for parameters [n > 2k, k, d = n — 1]. This code makes use of 



interference alignment, and is termed as the 'MISER' code in the journal-submission version [11| of 



T0|. Following the initial submission of piJ|, it is shown in JT2J that for this set of parameters, the 



code introduced in [10] can perform exact-repair of parity nodes as well, and such repair schemes are 



indeed designed and presented. The impossibility of constructing linear, scalar (i.e., (3 = 1) exact-MSR 



codes when d < 2k — 3 is shown in [10|, [11|. On the other hand, in the limiting case of B (and 



8 



hence a and (3) approaching infinity, the MSR point is shown to be achievable under exact-repair for 



all [n, k, d] in [14|, [15|. 



A general framework, termed the Product-Matrix framework, that enables construction for a wide 
range of parameters is introduced in p6| . Explicit codes at the MBR point for all values of the parameters 
[n, k, d] and at the MSR point for the parameter set [n, k, d > 2k— 2] are constructed in this framework. 
Also contained in the paper is a simpler description of the MISER code in the Product-Matrix framework. 

There has also been some work in the literature that consider slightly different models for distributed 
storage systems. Codes for more relaxed settings with respect to the data recovery /repair requirements 



are presented in |T7|, |T8j. The papers [|19j-[|22[ provide alternative frameworks for regenerating codes 
that introduce additional parameters for the system; tradeoffs between storage and repair are derived in 
each of the papers for the functional repair scenario. 

Following the initial presentation of the exact-MBR codes performing repair-by-transfer in [[8j| (de- 
scribed in Section [III] in the present paper), El Rouayheb et al. [|23| use the graphical approach to 



construct codes for a larger set of parameters, in a somewhat relaxed setting where a replacement node 
can connect to only certain fixed subsets of d nodes for repair ^\ They also provide upper bounds on 
the storage capacity of such systems. 



III. Explicit Exact-MBR Code for d = n — 1 with Repair-By-Transfer 

In this section, we provide an explicit construction of exact-MBR codes wherein the parameter d 
takes the largest permissible value of n — 1 £j These codes are capable of performing exact-repair of 
any failed node in such a way that the repair process is accomplished with mere transfer of data and 
without need for any arithmetic operations either at the helper nodes or at the replacement node. This 
property makes the code practically appealing. 

First we present a brief overview of the Minimum Bandwidth Regenerating point on the storage- 
bandwidth tradeoff and the concept of striping of data which will be used in the code construction. 



A. MBR Point Parameters 

The MBR point is as extreme point on the storage-bandwidth tradeoff that corresponds to the least 



possible repair bandwidth. As previously discussed in Section I-C the parameters a and (3 for the MBR 



3 ln this work, the authors refer to the repair-by-transfer property of codes in |8| as uncoded-repair. 

4 lt can be inferred from the storage-bandwidth tradeoff l|3j that, for fixed values of the parameters n, k, a and B, the repair bandwidth 
d/3 decreases with increase in d. 
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point satisfy 

a = d/3 

b = (kd-^^piy. (id 

Thus at the MBR point, a replacement node downloads no more than the number of symbols it eventually 
stores. 

At this point, we briefly digress to analyze a particular relation between the parameters (a, (3, B) 
of a regenerating code, that will aid in simplifying the system implementation of the code. 



B. The Striping of Data 

Given a set of parameters (£>, a, (3) satisfying the tradeoff (|2]) with equality, the parameters {a 1 = 
5a, (3' = 5/3, B' = 5B) for any positive integer 5 also satisfy the tradeoff (|2]) with equality. Thus, for 
some [n, k, d], an optimal regenerating code for (a', (3' , B') can be obtained easily by dividing the 
B' = 5B message symbols into 5 groups of B symbols each, and applying the optimal (B, a, (3) code 
to each group independently. In particular, if one can construct an [n, k, d] MBR code with (3 = 1, 
then one can construct an [n, k, d] MBR code for any larger integer value of (3 as well. 

From a practical standpoint, a code with smaller (3 will involve manipulating a smaller number of 
message symbols and hence will in general, be of lesser complexity. For these reasons, in the present 
paper we design codes for the simplest case of (3 = 1. In this situation, the values of a and B at the 
MBR point are given by 

a = d 
B = kd- 




C. An Example Code 



The example deals with the parameter set [n = 5, k = 3, d = 4], (3 = 1 which from equation ( fT2] ) 
gives a = 4 and B = 9. Let the 9 message symbols be denoted by < i < 9}. 

Encoding: Let C be a [10, 9]-MDS code, for example, C could be a single parity check code of 
length 10. Encode the 9 message symbols using the code C, and let the 10 coded symbols be denoted 
by {q 1 1 < i < 10}. Our code construction can be visualized via a fully connected graph (see Fig. [3]) 
on 5 nodes, each representing a distinct node in the network. To each of the 10 edges in the graph, we 
assign a distinct symbol from the set of code symbols {cj|l < i < 10} (in any particular order). 

In our construction, each storage node stores the 4 symbols assigned to the four edges incident on 
the node, as shown in Fig. [3] 
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V J 

Fig. 3: Graphical representation of an exact-repair code for the MBR point with [n — 5, d — 4, k — 3]. 

Data-Reconstruction: Supposing a data-collector connects to nodes 2, 3 and 4. The data-collector 
then recovers 3 x 4 = 12 coded symbols of which 12 - Q = 9 are distinct. Since the code C is an 
[10, 9]-MDS code by construction, the data-collector can recover the 9 message symbols from these 9 
coded symbols. 

Repair-by-transfer: Suppose node 3 fails, and is replaced by an empty replacement node. Under 
our construction, each of the four remaining nodes pass the symbol assigned to the edge that it has in 
common with node 3, i.e., nodes 1, 2, 4 and 5 pass on the symbols c 2 , c 5 , c 8 and c 9 respectively to 
the replacement node (from Fig. [3]). But these are precisely the four symbols that were stored in node 
3 prior to failure, and hence node 3 is exactly repaired. Note that the repair is accomplished by mere 
transfer of data and no arithmetic operations are required either at the helper nodes or at the replacement 
node. Hence we term this as repair-by-transfer, which is potentially useful in practical systems. 



D. Code Construction for the General Set of Parameters [n, k, d = n — 1] 

As discussed previously, the code is constructed for the case of f3 — 1, and codes for any higher 
value of (3 can be obtained via concatenation. The construction follows along the lines similar to the 
example provided in the previous section. 

Let the B message symbols be denoted by {rrii\l < i < B}. 

Encoding: Let C be an [(2), £>]-MDS code. Encode the B message symbols using the code C, and 
let the (2) coded symbols be denoted by {c, |l < i < Q)}- As in the example, we can visualize the code 
construction via a fully connected graph on n nodes, each representing a distinct node in the network. 
To each of the (2) edges in the graph, we assign a distinct code symbol from the set {q|l < i < (")} 
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(in any particular order). 

In our construction, each storage node stores the a (— n — 1) symbols assigned to the (n — 1) edges 
incident on the node. Thus, each symbol in {cj|l < i < (")} is stored in precisely two nodes. 

The following theorems establish the properties of data-reconstruction and repair-by-transfer. 

Theorem 1 (Data-Reconstruction): A data-collector can recover all the B message symbols by con- 
necting to any subset of k nodes. 

Proof: The data-collector connects to a subset of k nodes in the network, and recovers the ka (= kd) 
code symbols stored in these nodes. Since every pair of nodes has exactly one symbol in common, 
there are Q) redundant symbols among these kd code symbols. Thus, the data-collector has access to 
kd- (2) = B distinct code symbols from the set {q|l < i < (")}. Since the code C is an [("), 5] -MDS 
code by construction, the data-collector can recover the B message symbols {mj|l < i < B} from 
these B code symbols. ■ 

Note that in our construction, the data-reconstruction (decoding) procedure is identical to that of 
decoding an MDS code over an erasure channel. 

Theorem 2 (Repair-By-Transfer): Exact-repair of any failed node can be achieved by connecting to 
the remaining (n — 1) nodes and downloading the minimum possible data. Furthermore, the process 
involves mere transfer of data and does not require any arithmetic operations. 

Proof: On failure of a storage node, the replacement node connects to the (n — 1) remaining nodes. 
Each of the remaining nodes passes to the replacement node, the symbol assigned to the edge that 
it has in common with the failed node. By construction, these symbols are precisely the (n — 1) = 
a symbols that were stored in the node prior to failure. Thus, the replacement node simply stores 
these symbols, completing the process of exact-repair. Clearly, the repair process does not require any 
arithmetic operation either at the helper nodes or at the replacement node. ■ 



E. Size of the Finite Field 

The sole constraint on the field size required in the code construction arises from the need for the 
existence of an [("), B] MDS code. The existence of doubly extended Reed-Solomon codes tells us 
that a field size of (™) — 1 will suffice. 

Remark 1: The example employed a single parity check code as its MDS code. This has practical 
appeal since all operations can be carried out in binary field using only XORs. We note from the 



above that a single parity check code will suffice whenever k = n — 2, which from equation ( [12] ) gives 
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IV. Non-existence of Exact-Repair codes achieving the Interior Points on the 

Storage-Bandwidth Tradeoff 

We now move on to the second main result of the paper which proves the non-achievability of the 
interior points on the storage -bandwidth tradeoff under exact-repair. Originally, the storage -bandwidth 
tradeoff was derived for the case of functional-repair. However, given the clear advantages of exact- 
repair, much of the work in the field of regenerating codes has been dedicated to exact-repair. 

In particular, the two extreme points of the tradeoff (MBR and MSR) have been much studied, and 
the achievability of the tradeoff for exact-repair at the extreme points has been characterized to a large 



extent. Interference Alignment based explicit constructions in [10|-|12| achieve the cut-set bound at 
the MSR point for [n = d + 1, k, d > 2k — 1]. Product-Matrix framework based exact-repair code 
constructions in p6| cover all possible values of the parameters [n, k, d] at the MBR point, and all 
[n, k, d > 2k — 2] at the MSR point, thus proving the tightness of the storage-bandwidth tradeoff 



for these regimes. Further, in [10|, [11|, the authors show that there exist no linear, scalar (i.e., with 
(3 = 1) MSR codes performing exact-repair when d < 2 k — 3. In p4|, p3), the authors show that in the 



asymptotic case of a, /3 and B tending to infinity, the MSR point can be achieved under exact-repair 
for all possible [n, k, d]. 

However, the tightness of the storage-bandwidth tradeoff under exact-repair has remained open 
for the interior points. In the present section, we address this issue by proving the impossibility of 
constructing codes performing exact-repair at almost all interior points on the storage-bandwidth tradeoff. 
Furthermore, we identify scenarios, which we term as helper node pooling scenarios, that overconstrain 
the system. 

This section is organized as follows. First, an information-theoretic perspective for regenerating codes 



is presented in Section IV-A which is used throughout the present section. Next, the storage -bandwidth 



tradeoff (|2]) is re-derived information-theoretic ally in Section IV-B Section IV-C presents a convenient 
means of representing every point on the storage -bandwidth tradeoff as a function of parameters a and 
(3. Following this, a set of conditions imposed by the cut-set bound on the amount of information stored 



and passed by the nodes are derived in Section IV-D In addition, 'helper node pooling' scenarios are 



defined in this section. These properties are then utilized in Section FV-E to establish non-achievability 
of the interior points on the storage-bandwidth tradeoff for exact-repair. Finally, in Section IV-F[ an 
achievable curve is plotted via a simple storage-space sharing scheme between existing codes for the 
two extreme points. 



A. Notation 



While in earlier sections we worked with individual symbols from a certain alphabet, in keeping with 
the information-theoretic approach of the current section, we treat the message symbols as well as the 
data stored and passed by the nodes as random variables. 
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Under this information-theoretic perspective, the nodes in the network store data pertaining to a source 
(message) M, whose entropy is B, i.e., 

H(M) = B. (13) 

Next, we introduce the random variables, pertaining to the data stored in the nodes and the data passed 
by nodes for data-recovery and repair purposes. 

Let We denote the random variable corresponding to the data stored in node i (1 < i < n). We will 
assume that each storage node has a storage capacity of a and is hence incapable of storing variables 
whose entropy are greater than a, thus 

H(W e ) < a. (14) 



Consider exact-repair of node £ using a set V of d helper nodes, and let node m 6 V. In this situation, 
we denote the random variable corresponding to the data passed by the helper node m to aid in the 
repair of node £ by v S l m . We also assume that the data pipes used for repair have capacity /3 and hence 
are incapable of carrying variables whose entropy are greater than f3, i.e., 

H( v S e J < p. (15) 



Both equation ( |T4| ) and equation ( [15] ) are in keeping with the original setting where each node had the 
capacity to store a symbols and each data pipe used in repair had the capacity to carry (3 symbols. 

Note that since repair is exact, the random variables We and x>^m are invariant with time, i.e., they 
remain constant irrespective of the sequence of failures and repairs that occur in the system Q 

A little thought shows that the reconstruction and exact-repair requirements can be stated information- 
theoretically as follows. 

(i) From the reconstruction property required of a regenerating code, it must be that, for every subset 
of k storage nodes: {£{ \ 1 < i < k}, we need 

f(m|{^}Li) =0- (16) 

(ii) Similarly, the exact-repair property requirement leads to the condition that for every node £ (1 < 

£ < n) 



HIW, 



{^i} me J=0. (17) 



where V represents the set of d helper nodes participating in the exact-repair of node £. 

In the sequel, for simplicity, we will drop the left subscript V and the set of d nodes participating in 
the repair process will be clear from the context. Furthermore, since a node can pass only a function 

5 In contrast, functional repair permits a replacement node to store data different from that stored in the failed node, thus leaving open 
the possibility that these variables are dependent on time. 
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of what is stored in the node, it follows that, for any node m, 

H(S e m \W m )=0. (18) 

Remark 2: Throughout this section, we will assume that all the random variables are functions of 
the message M. This is without loss of generality since one can always assume a genie that reveals 
all the extraneous sources of randomness to every entity in the system, and this would still retain the 
necessity of the properties proved here. However, for convenience, we do not indicate the dependence 
in the notation. Thus, we have 

H(W t \M) = and H(S e m \M) = 0. (19) 



Now, from equations (13) and (19), one can rewrite the reconstruction property in (16) as 



# (Wjjli) =B. (20) 

Next, we set up notation to denote certain sets of random variables which will be used frequently. 
Let A denote a collection of storage nodes. Then the set of random variables corresponding to the data 
stored in the nodes in A is denoted by 

W A ^{W i } ieA . (21) 
Further, define [m] as the set of numbers {1, . . . , m} for some positive integer m, and denote 

W[ m ] 4 {Wi}™! ■ (22) 
Note that the notation [0] will correspond to an empty set. 

The random variables corresponding to the data passed for repair may be grouped in two ways. 
Denote the collection of random variables passed by nodes in set A to assist in the repair of a particular 
failed node t by 

^=KU- (23) 

On the other hand, across the instances of failure of every node in set A, the collection of variables 
passed by a particular helper node m to each of these nodes respectively is denoted by 

Sjn = { } i^A ■ (24) 

Note that in both the cases above, the other helper nodes participating in the repair process will be clear 
from the context. 
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B. Information-Theoretic Derivation of the Tradeoff 

We now present an information-theoretic derivation of the storage-bandwidth tradeoff (|2]), since it is 
convenient to remain in the information-theoretic domain throughout this section. The results established 
in this subsection are derived only for the case of exact-repair. The extensions to the case of functional- 
repair are straightforward and are explained subsequently. 

The following lemma establishes a relation between the information stored in various nodes. 

Lemma 3: For an arbitrary node I, and an arbitrary subset A consisting of a (0 < a < d) nodes such 
that £ £ A, 

H(W e \S e A ) < min(a, (d - a)/3) (25) 

and hence 

H(W t \W A )< mm(a, (d - a)B). (26) 

Proof: Consider exact-repair of node £ by connecting to the a nodes in set A and (d — a) other 
arbitrary nodes. Denote the set of these (d — a) helper nodes by B. Then, the exact-repair of node £ 
requires 

= H(W t \S' A , S B ) (27) 

= H(W i \S A )-l(W e ;S B \S A ) (28) 

> H(W e \S e A ) -H(S e B ) (29) 

> H (Wt\S A ) -(d-a)p (30) 

> H(W e \W A )-(d-a)P , (31) 

where equation ( [3Q| ) follows since each of the (d — a) helper nodes in B can pass at most (3 units of 
information to the replacement node, and equation pTj ) is a result of the fact that a node can only pass 
a function of what it stores, i.e., H(Sf n \W m ) = for all nodes m E A. Equation ( pTj ), coupled with the 
constraint on the storage capacity of the nodes (i.e., H(We) < a) leads to the desired result. ■ 

Remark 3 (The case of functional-repair): In the case of functional repair, the data stored in a node 
after repair need not be identical to that stored in it prior to failure. This makes the corresponding 
random variable a function of time. In this scenario, the lemma above applies when We, is the random 
variable corresponding to the data stored in node £ after being repaired with the help of a nodes in A 
and (d — a) other arbitrary nodes. 

The next theorem gives a simple derivation of the storage-bandwidth tradeoff ([2]) for exact-repair 
from an information-theoretic perspective. 
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Theorem 4: Any [n, k, d], (B, a, (3) regenerating code must necessarily satisfy 



fc-i 



B < ^min{a, (d -£)$}. (32) 



1=0 



Proof: The reconstruction property (recall equation p0| )) requires 



B = H(W [k] ) (33) 



fc-i 



= (34) 

e=o 
fc-i 

< ^min{a, (d -£)$}, (35) 



where equation p5| ) follows from Lemma [3] ■ 

Remark 4 (The case of functional-repair): The above theorem holds for the case of functional-repair 
as well. Here, a sequence of failures and repairs is considered, starting from node 2 through node k (in 
this order). The d nodes assisting node £ + 1 (1 < £ < k — 1) in its repair include the £ nodes in set 
[£}, and is the random variable corresponding to the data stored in node £ after its repair. 

Next, we present a convenient way to represent all points on the storage-bandwidth tradeoff in terms 
of a and /3. 



C. Representation for the Points on the Tradeoff 



As previously discussed in Section I-C for any [n, k, d] optimal regenerating code, the parameter 
a lies in the range 

(d- (k- l))/3 < a < d/3. (36) 
Based on this, for a given value of (a, (5), we depict the relation between the two parameters as 

a=(d-p)P-9, (37) 



for some p and 6 with p G {0, . . . , k — 1} and 9 G [0, 0). Note that the range of a in ( |36| ) implies that 
for p = — 1, it must be that 9 = 0. 

The storage-bandwidth tradeoff can thus be partitioned into the two end-points and a middle region: 

1) The MSR point: p = k - 1 (which implies = 0) 

2) The MBR point: p = 0, = 

3) The Interior points: p G {0, . . . , k - 2}, G [0, /3) except {p = 0, = 0}. 

For instance, the values of (a, j3, p, 9) at the four points depicted on the storage-bandwidth tradeoff 
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in Fig. [2] are 

MSR : (2700, 300, 9, 0), 

IPi : (2786, 250, 6, 214), 

IP 2 : (3300, 204, 1, 168), 

MBR: (3600, 200, 0, 0). 



D. Properties of Exact-Repair Codes 

We now present a set of properties that any exact-repair code with parameters satisfying the storage- 
bandwidth tradeoff with equality Q, must necessarily possess. These properties pertain to the random 
variables stored by the nodes and those passed for exact-repair. The proofs of these properties are 
relegated to Appendix |A} 

The first two properties provide insights pertaining to data stored in the nodes, and the subsequent 
properties provide insights about the data passed for repair. 

Property 1 (Entropy of Data Stored): For an arbitrary storage node £, 

H(W l )=a. (38) 

Property 2 (Mutual information among the nodes): For a set A comprising of an arbitrary collection 
of a nodes, and an arbitrary node £ A, 

a < p 

I (Wf, W A )= { (a -p)/3-e p<a<k (39) 

a a > k . 

Note that in Property [2] above, a threshold effect manifests itself twice in the mutual information, the 



first threshold occurring at a = p + 1 and the second at a — k. This is illustrated in Fig. 4a This is a 
phenomenon similar (albeit more complex) to the single threshold effect in MDS codes where a code 
symbol has zero mutual information with upto (k — 1) other code symbols and has mutual information 
equal to its entropy with any k or more other code symbols. 

Further, note that with respect to the amount of mutual information between a single node and a 
bunch of nodes, the two extreme points behave differently as compared to the interior points. This is 



depicted in Fig. 4b and Fig. 4c As it turns out, it is this difference in behaviour that leads to the 



non-existence for the interior points. 

Corollary 5: Consider a set A comprising of an arbitrary collection of a < k nodes. In the situation 
where the set of d helper nodes assisting in the repair of an arbitrary node £ ^ A include the a nodes 
in A, it must be that 

H(W £ \S e A ) = min{a, (d - a)/3}. (40) 
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Fig. 4: The amount of information that a node t has in common with a set A of a other arbitrary nodes 
(Property |2j): for (a) a general point on the tradeoff, (b) the MBR point, and (c) the MSR point. 



Property 3 (Entropy of Data Passed): In the situation where node m is an arbitrary helper node 
assisting in the repair of a second arbitrary node £, it must be that 

H {S e m ) = (3, (41) 

irrespective of the identity of the (d — 1) other helper nodes. 

Helper Node Pooling: Regenerating codes permit a failed node to choose an arbitrary set of d 
remaining nodes to aid in its repair. In particular, this includes situations where nodes form a pool 
and help each other in the repair process. More formally, consider a set T consisting of a collection of 
f < (d + 1) nodes, and a subset 1Z of the set T consisting of r nodes. We refer to 'helper node pooling' 
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V(Q 




T T T 

Fig. 5: Helper node pooling, and the setting of Properties [4] and [IJ 




as a scenario where on failure of any node £ G TZ, the d helper nodes assisting in its repair include the 
(/ — 1) remaining nodes in T . We denote the remaining (d — (f — 1)) arbitrary helper nodes assisting 
in the repair of node t by V{£) f\ The helper node pooling scenario is illustrated in Fig. [5] 

Regenerating codes must necessarily satisfy helper node pooling scenarios. This leads to surprising 
(and as we shall see, implausible) upper bounds on the amount of information passed by a single helper 
node in the pool to multiple replacement nodes. In the following two properties, S^ is used to denote 
the random variable corresponding to the data passed by node m G FXJZ to assist in the repair of node 
i G TZ in the scenario where the d helper nodes are {(^\{^}) U V(£)} H . Further, in agreement with 
our earlier notation, we define 



qTl A f o£ \ 



Property 4: In the helper node pooling scenario where 

min{fc, /} > p + 2 > r, 
for an arbitrary node m G J-'XJZ it must be that 

H {Si) < 2(3-9. 
Property 5: In the helper node pooling scenario where 

min{A;, /}>p+l>r>2, 
for an arbitrary node m G FXJZ and an arbitrary pair of nodes {£i, £ 2 } G TZ, it must be that 

ff(5£ | St) < 9 



(42) 



(43) 



This notation is in anticipation of the usage of these sets in Properties 0and[5] which consider the repair of each of the nodes in 7?.. 
In the scenario considered, every replacement node in TZ connects to the remaining (/ — 1) nodes in T, and hence T forms a fixed set 
of helper nodes. On the other hand, the set V(£) comprising of the (d — (/ — 1)) helper nodes of node £ £ TZ is specific to node £ and 
is allowed to vary with I. 

7 The set V{t) representing the (d— (/ — 1)) arbitrary helper nodes assisting in the repair of node i plays no role in the properties or 
the proof. Hence, for ease of understanding, the reader may choose to assume this set also to be fixed, i.e., V(£) = V. 
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and hence 

H (S%) < + (r - 1)9. (44) 

Note that Properties [4] and [5] do not hold for the MSR point (which has p — k — 1) and are trivially 
satisfied for the MBR point (which has p = 0). 

E. The Non-existence Proof 

We now show that the properties derived in the preceding subsection over-constrain the system, 
causing a majority of the points on the storage-bandwidth tradeoff to be non- achievable under exact- 
repair. Recall that the parameters (a, (3) at any point on the tradeoff are written as 

a = (d-p)p-9, 

with the interior points corresponding to 

p E {0, . . . , k - 2} and 9 e [0,(3), except for the point {p = 0, 9 = 0}. 

We first consider the case when a is a multiple of (3, in Theorem |6j A majority of regenerating 
schemes and code constructions in the literature [|7|-p3|, [ fT6| , [ fT7| are designed for this case of a 
being a multiple of (3. Thus for this case, at the interior points it must be that 

a = (d-p)(3, 9 = 

with p lying in the range 

1 < P < k-2. (45) 

Theorem 6: For any given values of B and [n, k, d), exact-repair codes do not exist for the parameters 
(a, (3) lying in the interior of the storage-bandwidth tradeoff when 9 = 0. 

Proof: The proof is by contradiction: given values of B, [n, k, d] and (a, (3) such that 9 = 0, 
we assume the existence of an exact-repair code with these parameters and show that the code fails to 
satisfy some of the properties of exact-repair codes. 

Let Q denote the distributed storage network under consideration, and further let T denote an arbitrary 
sub-network of Q consisting of (d + 1) or greater nodes [^J Given an optimal exact-repair code for the 
network Q, it is clear that the code is also an optimal exact-repair code for the sub-network T , with the 
same parameter values k, d, B, a, (3. In the present proof, we restrict our attention to a sub-network 
T consisting of precisely [d + 1) nodes. 

8 As the notation suggests, in this proof, T plays the role of the fixed set in the helper node pooling scenario. 
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A brief outline of the proof is as follows. Clearly, the nodes in T form a helper node pool, and 
hence Property [5] can be used to upper bound the total amount of information that a node can pass to 
aid in the repair of a bunch of nodes. This limits the cumulative information received by the nodes in 
T during their respective repair operations, which in turn limits the total data B stored in the network 
to (d + 1)(3. Finally, the value of B determined by the storage-bandwidth tradeoff (|3]) is found to be 
strictly larger. 

Since the sub-network T consists of (d+ 1) nodes, a failed node £ G T is repaired with the assistance 
of the d remaining nodes in T . Thus for any node I e J 7 , the exact-repair property requires 



0. 



Also, for any three distinct nodes {m, £ 1; £ 2 } £ J 7 , Property [5] when 9 = implies that 



(46) 



(47) 



Now, for a data-collector to be able to recover the entire data by connecting to a set of k nodes /C C T 
it must be that 



B 





H{W K ) 


(48) 


< 




(49) 






(50) 


< 


H {{ S T\{t}} leJ ) 


(51) 






(52) 


< 


E H ( s ^ {m} ) 


(53) 








< 




(54) 










(d + l)0 . 


(55) 



Here, equation ( |50] ) follows from the exact-repair requirement stated in equation (j46J). Equation ( [52] ) is 
a re-writing of equation ( |5"T| ), and equation < f54] ) employs equation ( |47l ) and Property [3j 



On the other hand, since any optimal regenerating code must satisfy the storage-bandwidth tradeoff ([3]), 
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it must be that 



fc-1 



B 



yj min (a, (d — i)0) 
fc-i 

]Tmm((d-p)/3,(d-z)/3) 



i=0 



fc-1 



2(d - p)P + min - P)P, ( d ~ i)! 3 ) 



i=2 



(56) 
(57) 

(58) 

(59) 
(60) 



where equation ( |57] ) holds since a = (d — p)/3, equation ( [58] ) holds since p > 1 (see equation ((45])), and 



equations (59) and (60) are derived using d > k > p + 2. This is in contradiction to equation (55). 



Theorem 7: For any given values of B and [n, k, d], exact-repair codes do not exist for the parameters 
(a, (3) lying in the interior of the storage-bandwidth tradeoff when 9^0 except possibly for the case 

/ d — p — i 

p = k - 2 with either 6 > — (3 or k = 2 

V d -P 

Proof: The proof of this theorem also exploits the existence of helper node pools in the system. 
Please refer to Appendix [B] for the proof. ■ 



Remark 5: It can be verified that the properties derived in Section IV-D and the non-achievability 
results in the present section continue to hold even if optimal exact-repair of only k of the nodes is 
desired, and the remaining n — k nodes are permitted to repair functionally with no restriction on the 
repair bandwidth. 



F. An Achievable Curve via Storage Space Sharing 

We have seen that for a majority of the points in the interior of the tradeoff curve, the cut-set bound 
cannot be achieved under exact-repair. On the other hand, from the codes provided in flT6| , the cut-set 
bound can be achieved at the extreme points of the tradeoff curve: for all [n, k, d] at the MBR point 
and for [n, k, d > 2k — 2] at the MSR point. A linear storage-space-sharing scheme between these 
codes can be used to establish an achievable region in the interior of the tradeoff curve. Given the 
system parameters [n, k, d] and (B, a), with d > 2k — 2, the net repair bandwidth d(3 required under 
this scheme can be computed as 

2B-ka 

dp = k(d-k + iy m 

Fig. [6] depicts the curve achieved via storage-space-sharing alongside the storage-bandwidth tradeoff 
curve for the parameters [n > 18, k = 10, d = 18, B = 27000]. 
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Fig. 6: An achievable value of repair bandwidth d/3 for exact-repair of all nodes plotted alongside the 
storage repair-bandwidth tradeoff curve, which is a lower bound on the repair bandwidth. 



V. Conclusion 

In this paper, an explicit exact MBR code for the parameters [n, k, d = n — 1] is presented. This 
code has very low repair complexity; repair of a failed node can be achieved by mere transfer of 
data and does not require any computation. Moreover, this code, when specialized to the parameter set 
[n, k = n — 2, d — n — 1], can be constructed over the binary field, and can be implemented using 
solely XOR operations. 

A set of properties that any exact-repair code must necessarily satisfy are derived. Specific scenarios 
termed helper node pooling are identified, which lead to upper bounds (that are surprisingly small) on 
the amount of information that a node can pass to assist in the repair of a bunch of nodes. These upper 
bounds are then used to show the non-achievability of almost all interior points on the storage -bandwidth 
tradeoff under exact-repair. 
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Appendix A 

Proofs of the Properties of Exact-Repair Codes 



Proof of Property [/J Without loss of generality, assume 
data-collector connecting to the first k nodes, it must be that 



B 



H (W [k] ) 

H(W 1 )+J2H(W j+l \W m ) 



k—1 



1. Now, for reconstruction by a 

(62) 
(63) 



j=i 



k-l 



< a + J2H(W j+1 \W m ) 

i=i 
fe-i 

— a + minjo:, (d — j)(3} 



3=1 

k-l 

^min{a, (d-j)/3} 

3=0 

B. 



(64) 

(65) 

(66) 
(67) 



Here, equation ( |65] ) results from Lemma [3j equation ([66]) uses the fact that a < d(3 (equation ([36])), and 
equation ( [67] ) follows since we need to satisfy the storage-bandwidth tradeoff with equality Q. Thus, 
equation ( |64] ) must be satisfied with equality, which forces H{W\) = a. ■ 

Proof of Property |2] - The result clearly holds when a> k since (i) data contained in any k nodes 
suffice to recover the entire data, and (ii) H(We) = a (from Property [T]). 

Now for the case when a < k, without loss of generality, we assume that the set A comprises of 
the first a nodes in the system and node £ is the (a + \) th node, i.e., A = [a] and I = a + 1. For 
reconstruction by a data-collector connecting to the first k nodes, we need 



B 



< 



H (W [k] ) 

k-i 

k-l 

^min{a, (d-j)0} 

j=0 

B. 



(68) 
(69) 

(70) 
(71) 



where equation ( f70] ) follows from Lemma [3] and and equation ( fTT| ) is a result of satisfying the storage- 
bandwidth tradeoff with equality ([3]). Thus, equation ( f70] ) must be satisfied with equality. This, coupled 
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with the lower bound on each term H(Wj +1 \Wy]) from Lemma [3| gives (for the choice of j as a) 

H (W a+1 \W [a] ) = min (a, (d - a)/3). (72) 

Hence, 

I (W a+1 ;W [a] ) = H(W a+1 ) - H(W a+1 \W [a] ) (73) 

= a — min (a, (d — a)(3) (74) 

= (a-(d-a)P) + (75) 

= ((a-p)(3-9) + , (76) 

where equation f74] ) follows from Property [T] and equation f72] ). ■ 
Proof of Corollary [5J- From Lemma [3j it must be that 

H(W e \S e A ) < min{a, (d - a)/3}. (77) 

On the other hand, since H(Sf n \W m ) = for all node m E A, 

H(W £ \S e A ) > H(Wi\W A ) (78) 

= H(W e ) - I(Wf,W A ) (79) 

= a - {a - (d - a)(3) + (80) 

= min{a, (d — a)(3}, (81) 

where equation ( [80] ) employs Property [T] and Property |2] with a < k. ■ 

Proof of Property ^ Partition the set of <i helper nodes assisting in the repair of node £ into a set 
.4 consisting of [k — 1) nodes and a second set i3 consisting of the remaining (d — k + 1) nodes, such 
that node m G B. Then, Corollary [5] mandates that 

H(Wt\S A ) = (d-k + l)p. (82) 

However, exact-repair of node I requires 

H(W e \S A ,S e B ) = 0. (83) 



Equations ( |82[ ) and ( |83[ ) imply that 

# > (d - k + 1)13. (84) 



Noting that each of the (d — k + 1) helper nodes considered in equation ( [84] ) can pass no more than (3 
units of information, it must be that 

H (Sj) = (d - k + 1)13, (85) 
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from which it follows that 

H (S g m ) = f3. (86) 



Proof of Property f?| Clearly, if the statement holds for some values of / and r, it continues to 
hold for all /', r' when f > f and r' < r. Hence, throughout the proof, the set 1Z is assumed to be 
comprised of r = p + 2 nodes, and the set T is such that T = 1Z U {m}. Thus, / = p + 3. 

Consider repair of an arbitrary node i E 1Z where the set of d helper nodes includes node m and the 

(p + 1) remaining nodes in 1Z. As an intermediate step, we wish to to prove H (S^J Wn) = 0. For 
this, consider, 

l(St;W n ) = I (S e ml W e ,W nm ) (87) 

= I (S £ m ,W nm ) + 1 (Si-,W E \W nm ) (88) 

> l{Si;W £ \W n \ {e} ) (89) 
= H(W e \W nm )-H(W e \Wn\{i } ,Si) (90) 

> H(W e \Wn\ { e})-H(We\Si m ,S e m ) (91) 
= (d-p-l)p-(d-p-2)p (92) 
= P, (93) 



where equation ( |9Tj ) follows since S^v r« is a function of VF^^} and equation ([92]) follows from 



Property [2] and Corollary [5] with r = p + 2 < k. Then it must be that, 

H(S l m \W n ) = H (S & m ) - I (S l m -Wn) (94) 

< P-P (95) 

= 0, (96) 

and hence 

H(S e m \W n )=0. (97) 



Now, since the choice of node I from the set 1Z was arbitrary, equation ( [97] ) holds for all £ G TZ and 
hence 

£T(5^|W W ) = 0. (98) 

It follows that 

H{Sl) = I{Sl;W n ) (99) 

< I(W m ,W n ) (100) 

= 2/3-9, (101) 



28 



where equation ( |101| ) follows from Property [2} ■ 

Proof of Property^ The steps followed in this proof are similar to those in the proof of Property [4] 
Clearly, if the statement holds for some values of / and r, it continues to hold for all /', r' when f > f 
and r' < r. Hence, throughout the proof, the set TZ is assumed to be comprised of r = p + 1 nodes, 
and the set T is such that T = TZ U {m}. Thus, f — p + 2. 

Consider repair of an arbitrary node i G TZ where the set of d helper nodes includes node m and 
the p remaining nodes in 71. As an intermediate step, we wish to to prove H (S^ Wn) < 9. For this, 
consider 



l{S e m ,W n ) = I {S £ m ,W e ,W nm ) 

= l{S e m ,Wn\ { e } )+l{S e m ,W £ \W nm ) 

= H(W e \W n \ {ey )-H(W e \W n \ {e} ,S l m ) 

> H(W,\W n \ {l] )-H(W,\S l nmi S l m ) 

= (d-p)P -9 - (d-p-l)p 

= P-e, 



(102) 
(103) 
(104) 
(105) 
(106) 
(107) 
(108) 



where equation ( |106| ) follows since S^UB 1S a f unct i° n °f Wn\{(} and equation ( |107[ ) follows from 
Property |2] and Corollary [5] with r = p + 1 < k. Then, it must be that 



H(S e m \W n ) = H (S e m ) - I (Si;W n ) 
= 9. 



(109) 
(110) 
(HI) 



Since the choice of node i from the set 1Z was arbitrary, the above equation holds for all t E TZ. 
Next, we prove H(Si\S^) < 9 for an arbitrary pair of nodes {£±, £2} G TZ. For this, consider 

H(StSt) = I(W n ;StSi) + H(StSi\Wn) (112) 

< I{W n] W m ) + H{Si,S%\W n ) 
= I(W n ;W m ) + H(Si\W n ) + H 

< (P-9) + 9 + 9 
= + 9, (116) 

where equation ( 113 ) follows since 3.nci are functions of W„ and equation \\\5) follows from 



(113) 
(114) 
(115) 



29 



Property [2j and equation < |111| ). Then it must be that, 

H(Sm\Sm) = H(Smi ^m) ~ H(Sm) (117) 

< (j3 + e)-p (lis) 

= e. (ii9) 



Finally, ordering the nodes in 1Z in an arbitrary manner as {£i \ 
equation ( |1 18| ) holds for every pair of nodes in 1Z, we have 

H{Sl) < H{S^) + j^H{S^\S^: 

i=2 

< (3 + {r-l)6. 



1 < i < r} and noting that 

(120) 
(121) 



Appendix B 
Proof of Theorem [7] 

Proof: The proof is by contradiction: given values of B, [n, k, d] and (a, (3) such that 9 ^ 0, 
we assume the existence of an exact-repair code with these parameters and show that the properties of 
exact-repair codes lead to a contradiction. 

As in the proof in Theorem [6} we restrict our attention to a sub-network T of the distributed storage 
network Q consisting of (d+ 1) nodes, and ignore the remaining nodes in Q. Thus, on failure of a node 
in this sub-network J 7 , the d helper nodes comprise of the d remaining nodes in T . 

Let {£, m} be a pair of nodes in the sub-network T . Partition the [d — 1) remaining nodes in T into 
two sets, namely A of cardinality p and B of cardinality (d — p — 1). 

Since the sub-network T consists of (d + 1) nodes, a failed node £ G T is repaired with the assistance 
of the d remaining nodes in T . Exact-repair of nodes t and m requires 

H(W i \S%SlS e m ) = 0, (122) 
H(W m \S%,S?,S?) = 0, (123) 



respectively. However, since S 1 ™ is a function of Wi for every helper node i, equations ( |122[ ) and < |123| ) 
lead to 

H(W e ,W m \W A ,S e B ,S%,S e m ) = 0. (124) 
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Now, 



H(SiS%,Si) > I(S B ,S%,S e m ;W e ,W m \W A ) 

> H(W e ,W m \W A ) 

= H(W t \W A ) + H(W m \W A ,W e ) 

= H(W £ )-I(W e ;W A ) + H(W m )-I(W m] W A ,W e 

= a-0 + a-(p-6) 

= (2d-2p-l)P-9, 



(125) 
(126) 
(127) 
(128) 
(129) 
(130) 



where equation ( |126[ ) follows from equation ( |124| ), and equation ( |129[ ) follows from Property [T] and 
Property |2j 

Next, we obtain an upper bound on the quantity H S 1 ^, S^J . We consider the case of p + 2 < k 
and the case of p + 2 = k separately. 

Case 1: p + 2 < k 
In this case, 



H(SiSZ,Si) < +H(S t m ) 

)+p 



= (2d-2p-l)p-(d-p-l)9. 



(131) 

(132) 
(133) 



where equation (132) follows from Properties [3] and |4j Since 9^0 and d > k > p + 2, equations ( |130| ) 
and ( |133[ ) are in contradiction. 

Case 2: p + 2 = k 

In this case, Property [5] is used to obtain an upper bound on H (Sg, Sg\ S^J. Note that this property 
does not hold when k = 2, and hence we consider the case when k > 2. 



H (S e B , Sq, S e m ) < y^iJ (Sf , S™) + # (5*^) 



< + + 



(d-p)/? + (d-p- 1)0. 



(134) 

(135) 
(136) 



where equation ( |135[ ) follows from Properties [3] and [5} Clearly, equations ( |136[ ) and ( |130[ ) contradict 
when 

« < (137) 

a — p 



